HFS File Structure
Volume Number: 1
Issue Number: 12
Column Tag: Ask Prof. Mac
HFS File Structure Explained
By Steve Brecher, Software Supply, MacTutor Contributing Editor
Anonymous Questions
Q. I know you like to give credit to readers who send questions. But if I prefer
that you not identify me, will you honor my request?
A. Yes. In particular, don't be afraid to ask "dumb" questions!
DS versus DC
Q. Loftus Becker of Hartford, CT asks how one should choose between DS and DC
for data storage in assembly-language programs. Is it a question of style, or are there
reasons of substance to use sometimes one, sometimes the other?
A. The MDS Assembler DS directive reserves global variable storage that the
program will address with an offset from the A5 register. The linker calculates the
offset of each variable declared with DS, and substitutes the offset where you use the
symbolic name of the variable. The linker also puts the total size of the global area
into the CODE 0 resource, so than the Segment Loader can allocate sufficient space
"below A5" when the application is started. DS stands for "define storage.
The DC directive allocates static space within your code segment, at the place
where the directive appears. The assembler will generate the value(s) given as
arguments to DC, which stands for "define constant." DC provides a way to assemble
arbitrary data instead of instructions. The assembler generates PC-relative addressing
for references to DC data.
DS global storage is not initialized when the application is loaded; DC data is
initialized.
DS variables may be directly written with a MOVE instruction, e.g.,
Move #1234,MyDSVar(A5)
DC storage cannot be the direct destination of a MOVE instruction, because the
68000 does not allow the PC-relative addressing mode for the destination of a MOVE.
To alter DC data, its address must first be placed in a register, and then it can be
written to using register indirect addressing, e.g.,
Lea MyDCData,A0
Move #1234,(A0)
DS storage is always resident while your application is running. DC data is local
to the segment in which it is assembled, and goes away if the segment is purged.
DS storage does not use disk space within your application file; DC storage does
take up disk space.
In sum, DS is best used for program variables, while DC is best used for
constants.
SFGetFile Unmounts
Q. I'm writing a desk accessory that shows information about all online volumes.
It also allows the user to change file attributes. I've noticed that sometimes when I put
up an SFGetFile dialog box to allow the user to choose a file, after SFGetFile returns my
DA does not show all the volumes that were there (in the VCB queue) prior to the
SFGetFile. Why?
A. One of the last things SFGetFile does after the user clicks Open or Cancel is to
unmount any online volume that is not in a drive. I'd guess the reason it does this is to
avoid cluttering the VCB queue (list of online volumes) with disks that the user
inserted and then ejected while the SFGetFile dialog box was up. A user searching for a
file could possibly insert and eject a large number of disks. SFGetFile's volume purge
routine does not discriminate among offline volumes which were mounted previously
and those which were mounted while its dialog was active.
PRECs: Correction
In an answer last month, the format of a PREC resource was given incorrectly. A
PREC resource specifies a selection of printer paper sizes. The correct format is:
n [1-word count] -- number of active paper sizes
6 pairs of words -- vertical,horizontal paper size in 120ths of an inch
6 Pascal-format strings -- names of paper sizes
The first word of the resource is a count ranging up to 6 of the number of paper
sizes that are "active," i.e., meaningful. The first "count" pairs of size words and the
first "count" names are "active"; the remaining pairs of size words and name fields, if
any, are unused.
SystemTask within DA
Q. If some code in a Desk Accessory is time-consuming, can a call to SystemTask
be placed in it? I have tried doing this with no apparent harm, but I'm concerned about
re-entrancy problems.
A. SystemTask is a routine generally called in the main event loop of
applications as a "courtesy" to desk accessories and device drivers that need to have
their Control routines entered at periodic intervals. A DA or driver indicates such a
need by setting the dNeedTime bit in the flags word in its header.
SystemTask uses a semaphore variable in low memory to enable it to ignore
recursive calls to itself. Also, it will not generate a Control call to a driver that is
currently executing. So calling SystemTask from within a DA or driver should be OK,
and you needn't worry about re-entrancy because of it.
HFS Preview
We temporarily interrupt our Q&A format to bring you a preview of HFS, the
new Hierarchical File System that is being delivered with Apple's 20MB disk.
Third-party disk vendors either have already implemented or are working on HFS
compatibility.
HFS -- known as TFS (Turbo File System) prior to release -- replaces MFS,
the original Macintosh File System. It is, in effect, a new File Manager. Along with it
comes a new Finder. However, Apple has gone to great pains to make the new system
upwardly compatible with existing applications and existing disks. All MFS file
system calls are supported, and HFS can handle MFS disks.
HFS implements true nested subdirectories in the file system itself. Under MFS,
a pseudo-subdirectory scheme was simulated by Finder folders -- but only within
Finder, and at great expense in Finder execution time. Under HFS, the Finder's desktop
looks much the same, but now the folders are real subdirectories. The new Finder
does have an addtional cosmetic command in the View menu, "by Small Icon," that
causes files to be identified by reduced-size icons with the filename to the right of each
(see Figure 1). This option permits many more files to fit within a given Finder
window; it's available for MFS disks as well as HFS disks.
With HFS, it's no longer necessary to partition hard disks into pseudo- volume
subsets. Instead, users can arrange their files into folders (directories) so that many
files -- thousands -- can co-exist manageably on one disk volume. Directories can
contain directories as well as files, making for a tree file-catalog structure. Two or
more files on a disk can have the same name, as long as each is in a different directory.
HFS also uses a different scheme to allocate and keep track of used/unused space
on a volume so that the minimum file allocation size can be smaller -- 0.5K, as
opposed to 1K for MFS floppies or 2K-4K or more for MFS hard disks. Thus more
small files can fit on a disk.
The biggest change the user will see is in the new SFGetFile and SFPutFile dialogs.
Figures 2-4 show typical SFGetFile displays of the same disk whose Finder view is
shown in Figure 1. In Figure 2, the user sees the folders (and files, but there are none
in the example) of the volume's root directory. He opens Folder 1, and then sees the
dialog box shown in Figure 3. Folder 1 in this example also has only folders within it,
although it could contain files. The user opens Folder 2a, and sees the dialog shown in
Figure 4 which displays the four files and one folder that are the contents of Folder 2a.
The user still hasn't located his file, and he wants to go back up the directory tree. He
clicks on "Folder 2a" on top of the scroll box, and gets a pull down menu of all the
directories he has traversed back through and including the root (which has the same
name as the volume). By dragging the mouse down and releasing, just as for a normal
menu, the user can go back to any of the higher-level directories -- he isn't limited to
going back one level at a time.
The SFPutFile dialog box (not shown) has a new scroll box, like SFGetFile's, in
addition to the TextEdit item for entering a file name. It works like SFGetFile's, but it
shows only folders (directories), not files, since its only purpose is to enable
navigation among directories before specifying the name of a new file to be created.
Now let's turn to the programmer's view of HFS.
Each element of the file system catalog tree -- each directory or file -- is called
a Catalog node (Cnode). The directory at the base of the tree is called the root, and has
the same name as the volume. Intermediate nodes in the tree are always directories;
terminal (leaf) nodes -- those with no branches extending from them -- are either
empty directories or files.
Each Cnode has a name (such as "MacPaint Documents Folder" or "Letter to
Birks, Druffey, & Co."). Cnode names are strung together, with colons in between, to
form a pathname. Each element of a pathname except the last must be a directory; the
last may be either a directory or a file. A full pathname starts with the root directory
name, and describes an arbitrary path to a given Cnode; for example:
My HFS Disk:Folder 1:Folder 2a:Sources:Foo
A partial pathname is one that either starts with a colon or that consists entirely
of a single name, e.g,
:Folder 2a:Sources:Foo
Foo
An empty pathname element -- a pair of colons -- signifies a move up the tree,
i.e., it stands for the parent of the current location. Example:
My HFS Disk:Folder 1:Folder 2b::Folder 2a:Sources
Three colons signifies a move up two levels, four colons up three levels, etc.
Internally, each directory CNode has an identifying number -- a DirID. Cnodes
can be identified to HFS system routines by using a partial pathname, together with a
DirID that denotes a directory from which the partial pathname specifies a path to the
desired Cnode. Note that full pathnames may in some cases exceed 255 characters in
length and thus might not be respresentable as Pascal-format strings.
By using the new file system call OpenWD, the programmer can specify a
working directory. The call returns a WDRefNum which can be used in subsequent
calls to identify a Cnode. WDRefNums are negative, like VRefNums, but their values
won't overlap those of VRefNums. When a WDRefNum is passed to a file system routine
in place of a VRefNum, the system will look up the volume and the directory from the
associated Working Directory Control Block (WDCB) that was filled in by an OpenWD
call. The system creates a fixed pool of WDCBs at startup, and maintains its contents;
the programmer need not be concerned with WDCBs directly.
The working directory scheme is one of the keys to compatibility with MFS.
Under HFS, SFGetFile might return a WDRefNum in its reply record in the field in
which it returns a VRefNum under MFS. If the program then passes that value to an
_Open call, the desired file will be opened; the program needn't be conerned about nor
know whether SFGetFile returned a WDRefNum or a VRefNum.
Existing programs that pass volumename:filename strings to the File Manager
rather than using VRefNums will not be able to access HFS files (except those that are
in the root directory). This includes programs written in C which use stdio routines,
since those routines take only a string as a file identifier. (Specifically, it includes
the programs in the MDS package.) Such programs can be fixed by inserting a call to
SetVol between the call to SFGetFile and the call to fopen(). If SetVol is passed a
WDRefNum, it will set the default directory as well as volume. If permanently
changing the default volume/directory is not desireable, the previous default can be
obtained first via GetVol, and restored after the fopen() with another SetVol. C stdio
programs written in this way will work under both MFS and HFS.
Memory structures such as the Volume Control Block (VCB) and File Control
Block (FCB) have expanded. However, all the new information is in the expansion, so
that programs which access these structures under MFS will still find the old fields in
the same place. Existing programs that access the FCB buffer -- which is an array
rather than a linked list -- won't work, however (Apple has been warning developers
of this for some time).
As mentioned, the existing MFS File Manager calls are all supported under HFS.
In addition, there are some new File Manager routines. Some of the new routines are
extensions of the old ones which return additional information -- HFS-specific fields
-- in expanded I/O parameter blocks. Other new routines are similar to their MFS
counterparts except they accept an additional DirID field in the parameter block. The
rest of the new routines have no MFS counterparts; e.g., a routine to move a file or
directory from one directory to another on the same disk (does not physically move the
file, only changes the catalog); and the routines which open, close, and interrogate the
contents of WDCBs. The File Manager changes will be documented in a new chapter of
Inside Macintosh to be available soon (if not already available by the time you read
this).
The initial release of HFS is RAM-resident in high memory. The final release is
expected to reside in the new ROM (Rumored-Only Memory). HFS takes up additional
system heap space (e.g., for the WDCB pool and an enlarged trap dispatch table); and
the initial RAM version uses even more system heap space for system patches. The
system heap is larger, but even so it's relatively cramped; system heap hogs beware.